HUI-Audio-Corpus-German: A High Quality TTS Dataset

نویسندگان

چکیده

The increasing availability of audio data on the internet leads to a multitude datasets for development and training text speech applications, based deep neural networks. Highly differing quality voice, low sampling rates, lack normalization disadvantageous alignment samples corresponding transcript sentences still limit performance networks trained this task. Additionally, resources in languages like German are very limited. We introduce “HUI-Audio-Corpus-German”, large, open-source dataset TTS engines, created with processing pipeline, which produces high transcription alignments decreases manual effort needed creation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

French Learners Audio Corpus of German Speech (FLACGS)

The French Learners Audio Corpus of German Speech (FLACGS) was created to compare German speech production of German native speakers (GG) and French learners of German (FG) across three speech production tasks of increasing production complexity: repetition, reading and picture description. 40 speakers, 20 GG and 20 FG performed each of the three tasks, which in total leads to approximately 7h ...

متن کامل

High quality TTS voices within one day

State-of-the-art unit-selection text-to-speech systems currently produce very natural synthetic speech, at the price however of a costly and time-consuming voice creation process. We report here an extensive perceptual evaluation of several voice creation strategies, and conclude with a novel 1day process giving access to high quality TTS voices.

متن کامل

A High-Quality Web Corpus of Czech

In our paper, we present main results of the Czech grant project Internet as a Language Corpus, whose aim was to build a corpus of Czech web texts and to develop and publicly release related software tools. Our corpus may not be the largest web corpus of Czech, but it maintains very good language quality due to high portion of human work involved in the corpus development process. We describe t...

متن کامل

Coding High Quality Digital Audio

The author has led the campaign mounted by Acoustic Renaissance for Audio of which he is Chairman. The ARA has made consistent arguments for higher standards of audio recording quality on the next major format (which should be DVD Audio). This article is an adaptation of material that the author has presented during this campaign. It summarises some of the important issues we face in deciding h...

متن کامل

High-Quality Low-Voltage Audio

Yet the audio quality on most devices is inferior. Typical signal-to-noise ratios measure in the 60 to 70 dB range (re –20 dBFS), which is about 5-10 dB worse noise than good portable CD players. They tend to suffer from non-flat frequency responses (bass and treble boost being the biggest offenders) with limited low frequency response (typically rolled off beginning at 60 Hz apparently to comp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-87626-5_15